Search CORE

137 research outputs found

Application of regulatory sequence analysis and metabolic network analysis to the interpretation of gene expression data

Author: A. Brazma
A.J. Enright
D. Gilbert
D. Thomas
E. Wingender
E.M. Marcotte
E.M. Marcotte
G. Reinert
H. Salgado
J. Helden van
J. Helden van
J. Helden van
J. Helden van
J. Helden van
J.H. Graber
J.L. DeRisi
M. Kanehisa
M. Pellegrini
M.B. Eisen
M.B. Eisen
P. Tamayo
P.D. Karp
P.O. Brown
P.T. Spellman
Publication venue: JOBIM
Publication date: 01/01/2000
Field of study

We present two complementary approaches for the interpretation of clusters of co-regulated genes, such as those obtained from DNA chips and related methods. Starting from a cluster of genes with similar expression profiles, two basic questions can be asked: 1. Which mechanism is responsible for the coordinated transcriptional response of the genes? This question is approached by extracting motifs that are shared between the upstream sequences of these genes. The motifs extracted are putative cis-acting regulatory elements. 2. What is the physiological meaning for the cell to express together these genes? One way to answer the question is to search for potential metabolic pathways that could be catalyzed by the products of the genes. This can be done by selecting the genes from the cluster that code for enzymes, and trying to assemble the catalyzed reactions to form metabolic pathways. We present tools to answer these two questions, and we illustrate their use with selected examples in the yeast Saccharomyces cerevisiae. The tools are available on the web (http://ucmb.ulb.ac.be/bioinformatics/rsa-tools/; http://www.ebi.ac.uk/research/pfbp/; http://www.soi.city.ac.uk/~msch/)

CiteSeerX

Crossref

DI-fusion

Brunel University Research Archive

The Iterative Signature Algorithm for the analysis of large scale gene expression data

Author: A. Brazma
A. Schulze
C.M. Perou
D.D. Lee
E. Lander
G. Getz
G. Sherlock
J. Ihmels
J.E. Staunton
J.L. DeRisi
Jan Ihmels
L. Lazzeroni
M. Bittner
M. Bittner
M. Schena
M.B. Eisen
N.S. Holter
Naama Barkai
O. Alter
P. Tamayo
P.T. Spellman
R.B. Altman
S. Tavazoie
Sven Bergmann
T. Hastie
T.G. Kolda
U. Alon
U. Scherf
Y. Cheng
Publication venue: 'American Physical Society (APS)'
Publication date: 08/10/2002
Field of study

We present a new approach for the analysis of genome-wide expression data. Our method is designed to overcome the limitations of traditional techniques, when applied to large-scale data. Rather than alloting each gene to a single cluster, we assign both genes and conditions to context-dependent and potentially overlapping transcription modules. We provide a rigorous definition of a transcription module as the object to be retrieved from the expression data. An efficient algorithm, that searches for the modules encoded in the data by iteratively refining sets of genes and conditions until they match this definition, is established. Each iteration involves a linear map, induced by the normalized expression matrix, followed by the application of a threshold function. We argue that our method is in fact a generalization of Singular Value Decomposition, which corresponds to the special case where no threshold is applied. We show analytically that for noisy expression data our approach leads to better classification due to the implementation of the threshold. This result is confirmed by numerical analyses based on in-silico expression data. We discuss briefly results obtained by applying our algorithm to expression data from the yeast S. cerevisiae.Comment: Latex, 36 pages, 8 figure

arXiv.org e-Print Archive

Crossref

Integrating Data Clustering and Visualization for the Analysis of 3D Gene Expression Data

Author: B. Hamann
C.C. Fowlkes
C.L. Luengo Hendriks
D.W. Knowles
E.W. Bethel
G.H. Weber
H. Hagen
J. Malik
M.B. Eisen
M.D. Biggin
Min-Yu Huang
O. Rubel
S.V.E. Keranen
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

Analysis of Agglomerative Clustering

Author: A.Z. Broder
Christian Sohler
Daniel Kuntze
F. Pereira
Johannes Blömer
K. Florek
K. Lee
L.L. McQuitty
M. Bādoiu
M. Charikar
M. Fréchet
M. Naszódi
M.B. Eisen
Marcel R. Ackermann
P.H.A. Sneath
R. Webster
S. Dasgupta
T. Feder
T.F. Gonzalez
W.B. Johnson
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 07/03/2014
Field of study

The diameter

k

-clustering problem is the problem of partitioning a finite subset of

\mathbb{R}^d

into

k

subsets called clusters such that the maximum diameter of the clusters is minimized. One early clustering algorithm that computes a hierarchy of approximate solutions to this problem (for all values of

k

) is the agglomerative clustering algorithm with the complete linkage strategy. For decades, this algorithm has been widely used by practitioners. However, it is not well studied theoretically. In this paper, we analyze the agglomerative complete linkage clustering algorithm. Assuming that the dimension

d

is a constant, we show that for any

k

the solution computed by this algorithm is an

O(\log k)

-approximation to the diameter

k

-clustering problem. Our analysis does not only hold for the Euclidean distance but for any metric that is based on a norm. Furthermore, we analyze the closely related

k

-center and discrete

k

-center problem. For the corresponding agglomerative algorithms, we deduce an approximation factor of

O(\log k)

as well.Comment: A preliminary version of this article appeared in Proceedings of the 28th International Symposium on Theoretical Aspects of Computer Science (STACS '11), March 2011, pp. 308-319. This article also appeared in Algorithmica. The final publication is available at http://link.springer.com/article/10.1007/s00453-012-9717-

arXiv.org e-Print Archive

Crossref

Validating Gene Clusterings by Selecting Informative Gene Ontology Terms with Mutual Information

Author: A. Alexa
A. Schliep
E.I. Boyle
F.D. Gibbons
G. McLachlan
I.G. Costa
L.J. Hubbert
M. Ashburner
M.B. Eisen
M.H. Jia
P. D’haeseleer
P. Westfall
R. Steuer
R.J. Cho
S. Grossmann
T. Beissbarth
T.M. Cover
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

We propose a method for global validation of gene clusterings. The method selects a set of informative and non-redundant GO terms through an exploration of the Gene Ontology structure guided by mutual information. Our approach yields a global assessment of the clustering quality, and a higher level interpretation for the clusters, as it relates GO terms with specific clusters. We show that in two gene expression data sets our method offers an improvement over previous approaches

CiteSeerX

Crossref

In silico characterization and expression analyses of sugarcane putative sucrose non-fermenting-1 (SNF1) related kinases

Author: Alderson A.
Altschul S.F.
Carling D.
Celenza J.L.
Douglas P.
Eisen M.B.
Estruch F.
Frattini M.
Gancedo J.M.
Graham I.
Halford N.G.
Hannappel U.
Hardie D.G.
Hawley S.A.
Hotta H.
Huang X.
Ikeda Y.
Koch K.E.
Krapp A.
Le Guen L.
Mackintosh R.W.
Muranaka T.
Ohba H.
Purcell P.C.
Simon M.
Smith R.F.
Sugden C.
Takano M.
Telles G. P.
Thompson-Jaeger S.
Vettore A. L.
Wilson W.A.
Woods A.
Publication venue: 'FapUNIFESP (SciELO)'
Publication date
Field of study

Crossref

Novel Stress-responsive Genes EMG1 and NOP14 Encode Conserved, Interacting Proteins Required for 40S Ribosome Biogenesis

Author: Ansari-Lari M.A.
Aris J.P.
Baim S.B.
Boeke J.D.
Carter A.P.
Chu S.
Dennis J. Thiele
DeRisi J.L.
Eisen M.B.
Elizabeth Craig
Franzusoff A.
Gautier T.
Gietz D.
Ginisty H.
Ginisty H.
Girard J.P.
Gorenstein C.
Hadano S.
Hakuno F.
Herruer M.H.
Holstege F.C.
Hughes J.D.
Hughes J.M.
Iouk T.L.
Jansen R.
Kim C.H.
Kondo K.
Kressler D.
Kressler D.
Lafontaine D.
Lascaris R.F.
Lee W.C.
Li H.D.
Li Y.
Liu X.D.
Lopez N.
Mager W.H.
Morrissey J.P.
O'Day C.L.
Pederson T.
Phillip C. C. Liu
Planta R.J.
Ross-Macdonald P.
Rout M.P.
Scheer U.
Tollervey D.
Venema J.
Venema J.
Vojtek A.B.
Wach A.
Warner J.R.
Wise J.A.
Wu P.
Zanchin N.I.
Publication venue: 'American Society for Cell Biology (ASCB)'
Publication date
Field of study

Crossref

Statistical Mechanics of Horizontal Gene Transfer in Evolutionary Ecology

Author: A. Babic
A. Huppert
A. Kreimer
A. Monier
A. Tsirigos
A.A. Salyers
B. Rodriguez-Brito
B. Snel
C. Adami
C. Adami
C. Brochier
C. Médigue
C. Pál
C. Waters
C.A. Suttle
C.A. Suttle
C.M. Thomas
C.R. Woese
C.R. Woese
C.R. Woese
C.R. Woese
D. Lindell
D. Prangishvili
D.N.L. Menge
E. Beltrami
E. DeLong
E. Denamur
E. Gladyshev
E.S. Anderson
F. Rodriguez-Valera
G. Hallegraeff
G. Hutchinson
G. Tyson
G. Wagner
G.J. McKenzie
H. Jeong
H. Simon
I. Chen
I. Hecht
J. Handelsman
J. He
J. Hotopp
J. Mahillon
J. Pace
J. Sun
J.A. Eisen
J.D. Elsas
J.J. Davis
J.M. Beman
J.P. Gogarten
J.R. Torre De La
K. Farahi
K. Vetsigian
K. Vetsigian
K.M. Nielsen
K.T. Konstantinidis
L. Hartwell
L.S. Frost
M. Lorenz
M. Parter
M. Scheffer
M. Syvanen
M.B. Sullivan
M.G. Weinbauer
M.G. Weinbauer
M.T.G. Holden
N. Chia
N. Chia
N. Goldenfeld
N. Kashtan
N. Kashtan
N.G. Anderson
N.U. Frigaard
Nicholas Chia
Nigel Goldenfeld
O. Cohen
P. Wilmes
P.J. Keeling
P.M. Bennett
R. Edwards
R. Thomas
R.C. Dewar
R.E. Mirollo
R.G. Beiko
R.I. Aminov
R.W. Hendrix
S. Garcia-Vallvé
S. Koike
S. Sonea
S.J. Sørensen
S.P. Hubbell
T. Butler
U. Bergthorsson
V. Kunin
V. Schoemann
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 09/12/2010
Field of study

The biological world, especially its majority microbial component, is strongly interacting and may be dominated by collective effects. In this review, we provide a brief introduction for statistical physicists of the way in which living cells communicate genetically through transferred genes, as well as the ways in which they can reorganize their genomes in response to environmental pressure. We discuss how genome evolution can be thought of as related to the physical phenomenon of annealing, and describe the sense in which genomes can be said to exhibit an analogue of information entropy. As a direct application of these ideas, we analyze the variation with ocean depth of transposons in marine microbial genomes, predicting trends that are consistent with recent observations using metagenomic surveys.Comment: Accepted by Journal of Statistical Physic

arXiv.org e-Print Archive

Crossref

Clustering Algorithms: Their Application to Gene Expression Data

Author: Agrawal R.
Alizadeh A.A.
Bandyopadhyay S.
Bandyopadhyay S.
Bezdek J.C.
Bezdek J.C.
Bezdek† J.C.
Bhargavi M.S.
Blatt M.
Bochkov Y.A.
Brunet J.P.
Bryan K.
Buitinck L.
Bunnik E.M.
Caliński T.
Chandrasekhar T.
Cheng Y.
Costa I.G.
Cover T.M.
D'haeseleer P.
Dave R.N.
Davies D.L.
De Morsier F.
Dempster A.P.
Dharmarajan A.
Dhillon I.S.
Divina F.
Do C.B.
Domany E.
Du Z.
Dunn† J.C.
Edla D.R.
Eisen M.B.
Ferguson T.S.
Frey B.J.
Fu L.
Fukuyama Y.
Galluccio L.
Gath I.
Getz G.
Gordon G.J.
Gu J.
Guha S.
Handhayani T.
Handl J.
Hatamlou A.
Heard N.A.
Heyer L.J.
Hinneburg A.
Hinneburg A.
Hu X.
Hubert L.J.
Jain A.K.
Jiang D.
Jiang H.
Joopudi S.
Kao Y.T.
Karmilasari S.W.
Karypis G.
Kaufman L.
Kerr G.
Kluger Y.
Kohonen T.
Kohonen T.
Krzanowski W.J.
Leone M.
Lu Y.
Lu Y.
Ma'sum M.A.
MacQueen J.
Madeira S.C.
Mann A.K.
Masciari E.
Maulik U.
Milligan G.W.
Mitra S.
Moon T.K.
Moore W.C.
Müllner D.
Nagpal A.
Nasser S.
Neal R.M.
Ng R.T.
Pakhira M.K.
Pal N.R.
Pedregosa F.
Pirim H.
Pitman J.
Prelić A.
Qin Z.S.
Raman S.
Rasmussen C.E.
Rezaee B.
Rezaee M.R.
Ruspini E.H.
Saha S.
Saha S.
Saha S.
Sathishkumar K.
Sheikholeslami G.
Sheng Q.
Sirinukunwattana K.
Sokal R.R.
Sun J.
Talaat A.M.
Tamayo P.
Tanay A.
Tang C.
Thalamuthu A.
Tibshirani R.
Wan M.
Wang L.
Wang W.
Williams G.
Wu J.
Wu K.L.
Wu S.
Xie X.L.
Xu R.
Xu Y.
Yu H.
Zhang D.
Zhang T.
Zhang Y.
Zhang Z.Y.
Zhao L.
Zhong C.
Zitnik M.
Řehůřek R.
Publication venue: 'SAGE Publications'
Publication date: 01/01/2016
Field of study

Gene expression data hide vital information required to understand the biological process that takes place in a particular organism in relation to its environment. Deciphering the hidden patterns in gene expression data proffers a prodigious preference to strengthen the understanding of functional genomics. The complexity of biological networks and the volume of genes present increase the challenges of comprehending and interpretation of the resulting mass of data, which consists of millions of measurements; these data also inhibit vagueness, imprecision, and noise. Therefore, the use of clustering techniques is a first step toward addressing these challenges, which is essential in the data mining process to reveal natural structures and iden-tify interesting patterns in the underlying data. The clustering of gene expression data has been proven to be useful in making known the natural structure inherent in gene expression data, understanding gene functions, cellular processes, and subtypes of cells, mining useful information from noisy data, and understanding gene regulation. The other benefit of clustering gene expression data is the identification of homology, which is very important in vaccine design. This review examines the various clustering algorithms applicable to the gene expression data in order to discover and provide useful knowledge of the appropriate clustering technique that will guarantee stability and high degree of accuracy in its analysis procedure

Covenant University Repository

Crossref

Directory of Open Access Journals

PubMed Central

In silico differential display of defense-related expressed sequence tags from sugarcane tissues infected with diazotrophic endophytes

Author: Altschul S.F.
Clark D.
Dellagi A.
Delledonne M.
Dong Z.
Durner J.
Eisen M.B.
Eulgem T.
Eulgem T.
Ewing R.M.
Gonzalez J.E.
Hara K.
Huang X.
James E.K.
Lambais M.R.
Lambais M.R.
Maleck K.
Marcio R. Lambais
McDowell J.M.
Niehaus K.
Olivares F.L.
Ruiz-Lozano J.M.
Smith R.F.
Telles G.P.
Vettore A.L.
Yang P.
Publication venue: 'FapUNIFESP (SciELO)'
Publication date
Field of study

Crossref